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INTRODUCTION 


Amidst  the  proposals  of  the  late  fifties  (1958)  a  novel  and  exciting  idea  was 
advanced  by  Hans  Peter  Luhn  of  IBM.  The  idea  he  proposed,  among  others,  was 
to  exploit  the  capabilities  of  computers  to  provide  the  users  of  literature 
with  tailor-made  announcements  about  recent  articles  in  their  individual 
spheres  of  interest  (1). 

In  the  system  envisioned  by  Luhn  the  computer  uld  compare  lists  of  technical 
expressions  describing  users  interests  with  the  technical  terms  in  the 
abstracts  of  scientific  articles.  As  matches  between  words  would  occur, 
computers  would  prepare  lists  of  names  of  the  people  who  were  likely  to  be 
interested  in  the  selected  documents.  The  users  would  be  notified  about 
such  articles  and.  asked  to  indicate  on  a  response  card  the  accuracy  of  the 
selection.  The  resultant  information  would  be  fed  back  to  the  system  to 
adjust  the  use."  ‘profile"  to  improve  the  accuracy  of  the  future  computer 
selections. 

it  is  common  knowledge  that  Luhn's  suggestion  has  been  tried  and  implemented 
in  many  places.  The  questions  that  need  to  be  asked  is:  How?  To  what 
extent?  With  what  degree  of  success?  Cost  vs  benefit? 

The  documentation  literature  in  which  the  SUI  operators  report  or,  their  systems 
provides  part  of  the  arswer.  Another  part  is  now  available  from  two  surwys. 
One  survey  involved  V  SOI  systems.  The  study  served  as  the  background  for 
the  design  of  SOI  services  in  technical  libraries  of  the  d.  S,  Amy.  It  was 
completed  in  1966  (2),  More  recently  another  inquiry  was  initiated  by  the 


-2- 


Office  of  Aerospace  Research  of  cne  U.  S.  Air  Force,  to  learn  more  about  the 
various  SDI  input-output  options,  with  particular  emphasis  on  the  economy  of 
operation.  This  later  study  is  still  in  progress.  To  date  39  systems  have 
responded.  Of  these  responses  25  have  beer,  analyzed.  The  facts  which  have 
emerged  from  this  inquiry,  coupled  with  the  findings  of  ethers,  are  presented 
in  this  paper. 


GENERAL  OVERVIEW 

We  would  like  to  start  the  general  overview  of  these  systems  by  a  few  remarks 
about  SOI  as  it  should  be  viewed  in  the  context  of  information-acquiring  habits 
of  scientists  and  engineers.  Imagine,  if  you  will,  the  bull's  eye  of  a  target. 
The  area  in  the  center  of  the  bull's  eye  represents  the  information  the 
scientist  regularly  receives  from  his  associates  and  the  journals  he  subscribes 
to.  As  the  circles  widen  out,  the  probability  of  coming  across  relevant  infor¬ 
mation  becomes  increasingly  smaller.  Items  found  in  obscure  journals,  foreign 
language  publications,  or  reports  of  smaller  industrial  R&D  establishments 
usually  fall  well  outside  of  the  bull's  eye  and  thus  escape  the  user's  immediate 
attention.  This  is  where  SDI  offers  ' is  greatest  help  -  by  increasing  the 
user's  peripheral  vision,  as  it  were. 

With  these  preliminary  remarks  out  of  the  way  we  want  to  move  to  some  obser¬ 
vations  about  the  present  SDI  systems. 

•  While  it  may  sound  strange,  SDI  has  been,  and  still  is  very  much 
the  American  information  technique.  As  far  as  we  know  (and  we 
could  be  mistaken)  the  only  countries  that  have  invested  in  SDI 


based  systems  are  the  U.S.  and  U.K.  Our  search  through  literature 
and  interviews  has  failed  to  uncover  a  single  operating  SDI  system 
in  any  other  country. 

•  If  the  various  sources  of  literature  are  to  be  believed,  we  should 
have  more  than  70  systems  in  operation.  However,  we  have  identified 
and  verified  only  39  systems.  There  are  still  several  to  be  examined; 
however,  we  do  not  expect  the  total  to  be  much  above  45.  These  45 
systems  serve  an  approximate  population  of  30,000  users.  Many  of 
these  users  are  served  through  group  profiles,  hence  the  actual 
number  of  profiles  that  are  being  processed  through  the  SDI  systems 

is  somewhat  lower.  Together  the  SDI  systems  process  some  100,000 
individual  items  of  literature  each  month, 

•  As  many  as  58%  of  the  systems  are  viewed  by  their  designers  or 
operators  as  fully  operational;  13%  as  partially  operational. 

Only  29%  are  considered  experimental,  i.e.,  in  a  stage  of  develop¬ 
ment  where  their  feasibility  and  utility  is  still  under  serious 
questioning. 

•  The  average  age  of  a  fully  operational  system  is  3,5  years;  the 
oldest  of  these  is  now  6.3  years  old.  The  average  age  of  the 
experimental  system  is  less  than  one  year. 

«  Significantly,  practically  all  of  the  systems  have  been  established 
by  such  institutions  as  government  agencies,  large  industrial 
organizations  and  large  information  services.  Our  sample  revealed 
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only  one  SOI  system  set  up  by  a  university  at  the  initiative  of 
its  library. 

•  Almost  one-half  of  the  systems  (42%)  depend  totally  on  the  prepara¬ 
tion  of  their  inputs  by  the  major  information  systems  of  the 
National  Aeronautics  and  Space  Administration,  Atomic  Energy 
Commission,  National  Library  of  Medicine,  Chemical  Abstracts  and 
the  Institute  of  Scientific  Information 

•  92%  of  the  systems  use  computers  to  perform  the  profile-document 
matching  operations.  The  equipment  used  so  far  ranges  all  the 
way  from  the  IBM  140!  to  the  IBM  360. 

9  Only  one  system  in  the  ..-ample  of  25  has  been  designed  solely  for 
technical  managers;  four  (16%)  are  used  exclusively  by  scientists. 
The  remaining  SO?*  are  used  to  provide  services  to  both  scientists 
and  technical  managers 

•  There  are  wide  differences  between  the  systems  with  respect  to 
number  of  document  entries  they  process.  The  largest  system 
claims  to  process  as  many  as  30,000  entries  each  month,  while 
the  smallest  runs  no  more  than  150.  Within  these  boundaries 
it  appears  that  there  are  two  major  groupings.  About  60%  of 
the  systems  process  anywhere  between  150  -  1,200  documents 
per  month  -  while  the  other  40%  process  anywhere  between 
1,200  and  300,000.  As  might  be  expected  the  large  processors 

'  are  AEG,  NASA,  and  the  Department  of  Defense, 
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•  The  present  systems  operate  on  profiles  of  single  individuals 

as  well  as  groups.  Forty  (40%)  of  the  systems  serve  individuals 
exclusively.  Another  40%  processes  both  the  individual  and  group 
profiles.  Only  20%  are  designed  to  serve  the  needs  of  groups  alone. 

•  Looking  at  the  character  of  the  groups  served  by  the  SDI  systems 

we  find  that  they  fall  into  three  categories:  administrative 
groupings  such  as  whole  companies;  the  task/mission  oriented 
groups  which  cover  a  wide  range  of  disciplines:  and  the  science 
oriented  groups  whose  interest  usually  centers  in  narrower  and 
more  easily  identifiable  segments  of  literature.  The  breakout 
within  our  sample  is  as  follows:  by  administrative  grouping  -  21%, 
by  task  orientation  -  5 / % ,  by  discipline  -  57%.  The  reason  these 

figures  add  to  more  than  100%  is  because  some  of  the  systems 

provide  services  to  more  than  one  group. 

•  Only  about  25%  of  the  systems  use  profiles  developed  through  the 

free  selection  of  terms  without  reference  to  a  thesaurus,  i he 
others  require  selection  of  terms  from  a  controlled  vocabulary . 

SPECIAL  OBSERVATIONS 

•  The  Costs  are  Appreciable  but  Not  Unreasonable 

Reliable  costs  of  $nI  services  are  difficult  t.o  obtain,  partly 
because  the  usual  SOI  service  is  a  part  of  the  larger  services 
provided  by  a  system  and  partly  because  of  the  lack  of  standard 
definition.  However,  there  are  some  indicators  that  the 
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apnroximate  average  cost  of  serving  a  single  user  is  about  $100  per 
year.  For  example  the  estimated  cost  of  operating  a  service 
using  the  SDI-3  program  on  an  IBM  1401  system,  processing  1,200 
documents  against  500  profiles  is  $92.50  per  profile.  (3:31) 

The  cost  of  the  selective  dissemination  system  of  the  Institute 
of  Scientific  Information  can  be  obtained  at  a  flat  basic  rate 
of  $100.00  per  user  per  year.  Similarly,  the  cost  of  providing 
service  to  some  200  Air  Force  users  enrolled  in  the  NASA  SDI 
system  amounted  in  1966  to  approximately  $80.00  per  user. 

•  Admittedly  these  are  relatively  high  costs,  but  not  unreasonable, 
if  one  considers  that  the  SOI  service  is  capable  of  saving  about 
1%  of  the  technical  man's  time.  (3:33)  Assuming  an  average  cost 
of  employing  a  technical  man  to  be  $15,000  per  year  --  this 
represents  a  saving  of  $150.00;  in  our  opinion  a  fair  return  on 
an  investment  of  $80  to  $100.  The  use  of  group  profiles  would, 
of  course,  make  these  savings  more  impressive.  But  the  really 
major  savings  are  not  represented  by  the  time  saved.  Rather, 
they  are  in  providing  the  technical  man  with  what  we  have 
earlier  called  peripheral  vision,  i.e.,  the  ability  of 
learning  about  information  which  could  be  essential  to  his 
work,  but  which  is  published  in  the  literature  he  ncnnally 
would  not  consider  as  likely  to  contain  anything  of  interest. 

In  this  connection  there  is  some  evidence  that  certain  users 
see  very  little  of  this  peripheral  information.  The  measure¬ 
ment  of  engineers  at  one  installation  exposed  to  an  SOI 


l 

i 

E 


[ 


-7- 


system  showed  that  only  10%  of  the  material  supplied  by  the 
SDI  system  had  been  seen  before  (8). 

•  User  Satisfaction  is  High 

The  measure  of  user  satisfaction  with  SDI  is  an  elusive  concept  - 
measured  mostly  on  subjective  terms.  Our  own  earlier  study  in 
1966  indicates,  however,  that  it  is  unusually  high.  Approximately 
96%  of  our  respondents  have  indicated  that  they  consider  the 
service  either  useful  or  highly  useful.  They  back  up  this  opinion 
with  an  indication  of  their  willingness  to  allocate  a  portion  of 
their  budget  to  continue  the  SDI  Services.  (3:33) 

«  Satisfaction  Correlates  Positively  with  the  Volume  of  Notifications  - 
Rather  than  with  the  Precision  of  Matching  Profiles 

Contrary  to  the  usual  belief,  no  correlation  has  been  determined 
between  the  degree  of  satisfaction  (expressed  in  terms  of  time 
saved  and  expansion  of  coverage)  and  the  accuracy  of  the  matching. 

In  one  study  users  with  the  matching  precision  percentage  as  low 
as  25%  wanted  to  continue  the  service  because  it  expanded  their 
literature  searching  abiiity  and  saved  them  tim».  Conversely, 
users  who  were  not  completely  satisfied  with  the  system  were 
receiving  notices  with  the  matching  precision  of  47%  and  higher.  (4) 
Thus  user  satisfaction  with  the  degree  of  precision  In  matching 
his  individual  interest  profile  depends  mainly  on  his  personal 
search  preference  --  whether  he  will  accept  less  precision  in 
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matching  his  profile  in  return  for  greats  coverage.  We  suspect 
that  this  is  largely  a  question  of  trade-off;  how  much  of  his  own 

I,. 

time  i'S  the  individual  user  willing  to  invest  in  final  screening 
of  the  documents  in  order  to  broaden  his  coverage?  For  example, 
starting  with  a  thousand  documents  to  be  screened  by  SDI 
techniques,  three  different  users  may  indicate  three  rather 
divergent  preferences 


Number  of  Documents 
\of  the  thousand] 

Screening 

Precision 


User 

Received 

Relevant 

Wnn-Relevant 

0/ 

/o 

A 

3 

3 

0 

100% 

B 

30 

15 

15 

50% 

C 

100 

30 

70 

30% 

In  the  same  study  the  author  shows  that  there  is  a  positive 
correlation  between  the  degree  of  satisfaction  and  the 
literature  coverage  provided  by  the  5DI  service. 

Similarly,  in  a  study  of  45  IBM  research  scientists  Resnick 
and  Hensley  (5:109)  found  that  users  do  not  want  to  place  a 
limit  on  the  number  of  notifications  received.  30%  of  them 
indicated  that  the  number  of  notifications  received  during  the 
experiment  was  too  high,  but  only  15%  of  subjects  indicated  that 
thi"  was  one  of  the  features  they  did  not  like. 
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•  SDI  Applications 

By  far  the  largest  area  of  application  is  in  providing  current 
awareness  to  scientists  and  engineers.  There  are  other 
applications,  however,  some  of  which  offer  immediate  and 
tangible  benefits  to  the  organization. 

One  of  the  earliest  such  applications  was  made  by  IBM  tc  pro¬ 
vide  information  service  to  marketing  personnel  rather  than 
to  research  and  development  people.  K^aft  describes  such  a 
system  used  by  the  IBM  Midwestern  Region  based  in  Chicago.  (61 
The  input  publications  consist  of  such  items  as  the  Wall  Street 
Journal ,  Fortune,  Sales  Management,  Control  Engineering,  etc. 

In  another  case  an  aerospace  company  is  looking  into  the 
possibility  of  using  SDI  to  disseminate  company  reports  to 
technical  managers  in  government.  Presumably  greater  know¬ 
ledge  on  the  part  of  government  persunr.el  about  the  competence 
and  products  of  a  given  company  might  improve  the  chances  for 
successful  competition  on  government  contracts. 

A  ncvel  use  of  SDI,  and  in  our  opinion  one  of  far-reaching 
significance,  is  its  recent  application  to  the  control  of  library 
acquisitions.  Mr.  Carlos  Segarra,  of  the  Army  Engineer  Research 
and  Development  Laboratories,  is  using  profiles  of  laboratory 
projects,  instead  of  people,  to  reflect  the  information  needed 
to  support  the  particular  project  or  task  (7).  The  resultant 
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profile  for  the  Center  consists  of  approximately  3,000  terms, 
updated  at  6  months1  intervals.  The  profiles  act  as  a  filter 
for  the  incoming  documentation,  rejecting  those  documents  from 
further  processing  into  the  library's  permanent  collection 
which  could  not  be  matched  up  with  the  Center's  interest.  Segarra 
claims  to  have  achieved  savings  of  $3.60  per  processed  document 
(reduction  from  $7.66  to  $4.06  per  report).  Additionally  this 
technique  reduced  processing  time  from  75  days  to  5  days  and 
significantly  increased  the  use  of  holdings. 

OBSERVABLE  TRENDS 


Conclusive  data  is  generally  lacking  to  establish  undisputable 
trends.  Even  so,  certain  actions  on  the  part  of  the  SDI  operators 
suggest  forces  at  work  in  several  directions. 

*  Efforts  to  Capitalize  on  the  Products  of  Large  Systems 

The  high  costs  of  identifying,  organizing,  indexing,  and  keyboarding 
information  about  the  documents,  coupled  with  the  evergrowing  demand 
for  coverage  of  a  wide  subject  area  are  the  obvious  motives  for  the 
efforts  of  maximum  utilization  of  tapes  and  indexes  by  large  systems. 
SDI  systems  which  take  inputs  from  several  such  systems  arc  now  in 
evidence,  and  are  expected  to  grow. 

•  Increased  Use  of  Group  Profiles 


Another  significant  development  is  the  trend  away  from  individual 
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profiles  to  group  and  standard  profiles.  The  evidence  of  this 
trend  is  present  in  the  recent  NASA  emphasis  on  ^heir  new  SCAN 
system,  which  provides  information  on  some  189  areas,  covering 
the  34  subject  categories  used  in  STAR  (Scientific  and  Technical 
Aerospace  Reports)  and  1AA  (International  Aerospace  Abstracts). 
It  is  also  shown  by  the  customer  preference  at  the  Aerospace 
Research  Application  Center  (ARAC),  a  self-supporting  private 
SOI  service  which  disseminates  government-generated  information 
to  industry.  When  given  the  option  to  subscribe  either  to 
individual  profiles  at  a  cost  of  some  $200  for  up  to  twenty 
notifications,  or  to  standard  profiles  at  $80  per  year,  70% 
of  them  elected  standard  group  profiles.  The  trend  is  being 
further  established  by  the  recent  experiments  conducted  jointly 
by  the  Air  Force  Office  of  Aerospace  Research  and  the  Clearing¬ 
house  for  Federal  Scientific  and  Technical  Information.  The 
basic  method  used  in  this  experiment  is  to  break  out  the 
Clearinghouse  announcement  bulletin  into  some  65  categories  to 
permit  receipt  of  only  those  announcements  which  fall  within 
the  users  area  of  interest.  This  experimental  announcement 
program  known  as  CAST  (Clearinghouse  Announcement  in  Science 
and  Technology)  oegan  in  September  1967.  950  Air  Force 

scientists  and  engineers  were  included.  If  proven  successful 
it  may  be  opened  to  general  subscription. 


k 
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•  The  Trend  to  Commercial  SDI  Subscription  Services 

The  trend,  which  we  believe  is  of  greatest  significance,  is  toward 
the  emergence  of  private  services  willing  to  provide  selective 
announcements  to  anyone  who  is  prepared  to  pay  the  service  fee. 
Perhaps  the  best  known  is  the  ASCA  (Automatic  Subject  Citation 
Alert),  an  SDI  service  of  the  Institute  for  Scientific  Informa¬ 
tion  ( I  SI )  in  Philadelphia.  ISI  has  been  testing  and  operating 
the  SDI  system  for  the  past  three  years.  The  test  involved  more 
than  500  scientists.  The  system  covers  annually  about  300,000 
articles  in  some  1,600  journals.  The  basic  annual  subscription 
costs  $100.00, 

Numerous  other  centers,  especially  those  established  under  the 
auspices  of  NASA  for  the  purpose  of  promoting  transfer  of  space 
age  advances  to  the  private  industrial  sector,  provide  SDI  services 
to  the  companies  and  to  private  individuals.  ARAC,  mentioned 
earlier,  is  one  of  such  centers  which  provides  a  commercial 
service  at  a  cost  of  $80/per  standard  profile.  Recent  efforts 
by  the  Chemical  Abstracts  Service  to  provide  their  tapes  for  SDI 
exploitation  by  major  companies  lends  further  indication  of  a 
trend  which  is  most,  encouraging. 

CONCLUSIONS 

«  Considering  the  fact  that  there  has  been  a  steady  and  rapid  growth 
of  SDI  systems;  that  many  ot  these  systems  have  been  in  existence 
for  a  number  of  years;  that  there  has  been  a  corresponding  growth 
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of  SDI  users;  and  that  GDI  is  now  made  available  through 
subscription  from  commercial  sources,  SDI  appears  to  be  becoming 
accepted  as  an  effective  and  economical  method  of  assisting 
scientists  and  engineers  in  keeping  abreast  of  advances  in 
their  fields. 

•  As  evidenced  by  the  respondents,  who  estimate  that  90%  of  the 
material  they  receive  from  SDI  was  information  they  had  not 
previously  seen,  SDI  is  especially  valuable  in  providing  the 
user  with  peripheral  vision  of  information  which  is  of  direct 
interest  to  him. 

«  The  relatively  large  computer  and  other  costs  of  installing  and 
operating  SDI  systems  tend  to  inhibit  any  but  the  larger 
organisations  from  establishing  SDI  services.  The  availability 
of  free  computer  time  for  initial  experimentation  is  usually  an 
essential  springboard  for  the  start  of  the  service. 

t  SDI  systems  appear  to  be  developing  in  two  major  directions: 

(1)  greater  use  of  machinable  records,  produced  by  large 
documentation  systems,  and  (2)  experimentation  with  group 
profiles,  doth  trends  promise  significant  reductions  of 
operating  costs. 

«  An  encouraging  trend  also  is  found  in  the  availability  of  SDI 
services  through  the  commercial  subscriptions  from  such 
organizations  as  Chemical  Abstracts  or  I S I .  This  promises  to 
bring  SDI  within  the  range  of  the  individual  or  small  organization 
anywhere  in  the  world. 
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t  Both  the  availability  of  computer  tapes  and  direct  SDI  services 
should  permit  greater  use  of  SDI  by  countries  who  may  not  as  yet 
have  resources  or  facilities  to  develop  and  build  independent 
services  of  their  own. 
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